Restructuring HMM states for speaker adaptation in Mandarin speech recognition

نویسندگان

Xianghua Xu

Qiang Guo

Jie Zhu

چکیده

With the tendency of posterior probability taken into account, a state-restructuring method is proposed based on confusions between HMM states. In the method, HMM state is restructured by sharing Gaussian components with its related states and the re-estimation of the increased-parameters, i.e., the inter-state weights, is derived under the EM framework. Experiments are performed on speaker-independent large vocabulary continuous Mandarin speech recognition. The results show the state-restructured systems outperform the baseline system and the combining with MLLR adaptation can lead to consistent and significant improvement on recognition accuracy over MLLR.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

In the EMIME project, we developed a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. We integrated two techniques into a single architecture: unsupervised adaptation for HMM-based TTS using word-based large-vocabulary contin...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

A Comparative Study of Several Incremental Adaptation Algorithms for Speaker Adaptation

We conduct a comparative study of five representative incremental HMM adaptation algorithms developed in the past few years. We report the experimental results of using these algorithms for on-line speaker adaptation in a continuous Mandarin Chinese speech recognition system. We identify the strength and weakness of individual algorithms and offer recommendations for practitioners to make intel...

متن کامل

Online hierarchical transformation of hidden Markov models for speech recognition

This paper proposes a novel framework of online hierarchical transformation of hidden Markov model (HMM) parameters for adaptive speech recognition. Our goal is to incrementally transform (or adapt) all the HMM parameters to a new acoustical environment even though most of HMM units are unseen in observed adaptation data. We establish a hierarchical tree of HMM units and apply the tree to dynam...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Restructuring HMM states for speaker adaptation in Mandarin speech recognition

نویسندگان

چکیده

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A Comparative Study of Several Incremental Adaptation Algorithms for Speaker Adaptation

Online hierarchical transformation of hidden Markov models for speech recognition

عنوان ژورنال:

اشتراک گذاری